Ranking Invariance Based on Similarity Measures in Document Retrieval

نویسندگان

Jean-François Omhover

Maria Rifqi

Marcin Detyniecki

چکیده

To automatically retrieve documents or images from a database, retrieval systems use similarity measures to compare a request based on features extracted from the documents. As a result, documents are ordered in a list by decreasing correspondance to the request. Several comparison measures are used in the field and it is difficult to choose one or another. In this paper, we show that they can be grouped into classes of equivalent behavior. Then, in a query by example process, the choice of these measure can be reduced to the choice of a family of them.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology based Similarity Measure in Document Ranking

This paper presents a methodology for the ontology based semantic annotation of web pages with annotation weighting scheme that takes advantage of the different relevance of structured document fields. The retrieval model is based on the importance factors of the structural elements, which are used to re-rank the documents retrieval by the ontology based distance measure. The relevance concept ...

متن کامل

Utilizing Passage-Based Language Models for Document Retrieval

We show that several previously proposed passage-based document ranking principles, along with some new ones, can be derived from the same probabilistic model. We use language models to instantiate specific algorithms, and propose a passage language model that integrates information from the ambient document to an extent controlled by the estimated document homogeneity. Several document-homogen...

متن کامل

Investigating the Impact of Authors’ Rank in Bibliographic Networks on Expertise Retrieval

Background and Aim: this research investigates the impact of authors’ rank in Bibliographic networks on document-centered model of Expertise Retrieval. Its purpose is to find out what kind of authors’ ranking in bibliographic networks can improve the performance of document-centered model. Methodology: Current research is an experimental one. To operationalize research goals, a new test colle...

متن کامل

بررسی نقش انواع بافتار هم‌نویسه‌ها در تعیین شباهت بین مدارک

Aim: Automatic information retrieval is based on the assumption that texts contain content or structural elements that can be used in word sense disambiguation and thereby improving the effectiveness of the results retrieved. Homographs are among the words requiring sense disambiguation. Depending on their roles and positions in texts, homograph contexts could be divided to different types, wit...

متن کامل

Semantic Search: Document Ranking and Clustering Using Computer Science Ontology and N-Grams

Semantic similarity has become an important tool and widely been used to solve traditional Information Retrieval problems. This study adopts ontology of computer science and proposes an ontology indexing weight based on Wu and Palmer’s edge counting measure and uses the N-grams method for computing a family of word similarity. The study also compares the subsumption weight between Hliaoutakis a...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Ranking Invariance Based on Similarity Measures in Document Retrieval

نویسندگان

چکیده

منابع مشابه

Ontology based Similarity Measure in Document Ranking

Utilizing Passage-Based Language Models for Document Retrieval

Investigating the Impact of Authors’ Rank in Bibliographic Networks on Expertise Retrieval

بررسی نقش انواع بافتار هم‌نویسه‌ها در تعیین شباهت بین مدارک

Semantic Search: Document Ranking and Clustering Using Computer Science Ontology and N-Grams

عنوان ژورنال:

اشتراک گذاری